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Abstract 

Prior distributions elicited for modelling the natural fluctuations or the uncertainty 
("^ ' on parameters of Bayesian fishery population models, can be chosen among a vast range 

Cn ■ of statistical laws. Since the statistical framework is defined by observational processes, 

observational parameters enter into the estimation and must be considered random, sim- 
^.^^ , ilarly to parameters or states of interest like population levels or real catches. The former 

V^ . are thus perceived as nuisance parameters whose values are intrinsically linked to the 

considered experiment, which also require noninformative priors. In fishery research 
Jeffreys methodology has been presented by Millar (2002) as a practical way to elicit 
such priors. However they can present wrong properties in multiparameter contexts. 
Therefore we suggest to use the elicitation method proposed by Berger and Bernardo to 
P^ , avoid paradoxical results raised by Jeffreys priors. These benchmark priors are derived 

^0 ' here in the framework of sequential population analysis. 

■5 

^ ' La grande variete des lois a priori utiUsables pour refleter I'incertitude sur un parametre, 

au sein d'un modele de dynamique de population traite dans un cadre statistique bayesien, 
necessite de pouvoir beneficier d'une distribution de reference, dite neutre (non-informative), 
a laquelle les comparer, en particulier pour juger I'impact de la prise en compte d'une 
connaissance parfois subjective. Dans cette optique, Millar (2002) a calcule \'a priori de 
QQ ' Jeffreys, qui possede la propriete d'invariance par reparametrisation, pour differents 

QQ I modeles simples de population utilises en halieutique. Cependant, cet a priori presente 

fT^ . de mauvaises proprietes lorsque le nombre de parametres est grand. Dans cette note, 

l/^ ' nous suggerons d'utiliser \'a priori de reference de Berger-Bernardo pour les parametres 

de nuisance lies au processus d'observation, dans le cas oCi Ton modelise les variations 
^— V ■ de population par des modeles sequentiels de grande dimension. 

O' 



Introduction 

■ »~~i I 

j^ ■ From years Bayesian statistics have been recognized as a practical methodology allowing 

H \ to take account of natural uncertainties arising in biological population models. TJiey have 

been applied to a growing number of problems in fishery research (Punt and Hilborn 1997). 
In most applications a large part of parameters can benefit from expert knowledge, such 
that prior distributions can be elicited in order to increase the data information summa- 
rized in the likelihood of observations. It is essential, however, that prior elicitation must 
be carefully led about other parameters for which no information is available. 

To avoid integrating unduly subjective information into Bayesian inference, the need 
for noninformative priors in fishery research has been highlighted by Millar (2002). He 
reviewed formal rules to define and elicit such priors in the Bayesian world and proposed 
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using Jeffreys priors. Indeed, unlike the common but tricking flat priors which lead to para- 
doxes (Kass and Wasserman 1996), Jeffreys priors have the property to be independent on 
parametrization choices. 

In many cases this property appears desirable. For instance, when the scale of an initial 
population is unknown (since nonobservable) and is considered as a parameter to be esti- 
mated, any subjective prior (even diffuse) is expected to influence the posterior estimation 
not only of the scale but of many other important quantities directly linked to the scale, 
Hke the maximum sustainable yield, in ways predictable with difficulty. A scale-invariant 
prior can solve such issues. Therefore Millar (2002) derived Jeffreys prior for some com- 
mon surplus production models and biological parameters of sequential population mod- 
els. Eliciting noninformative priors has another interest: the Bayesian analyst can aim to 
understand the amount of subjective information used for an assessment, comparing sub- 
jective and objective posterior distributions. 

If Jeffreys prior is quite useful and practical for a single parameter, it can however suffer 
from strong deficiencies in multiparameter problems. Cases of posterior inconsistency have 
been highlighted by Berger and Bernardo (1992). When deriving Jeffreys prior for sequen- 
tial population models, Millar (2002) warned his readers about this knotty feature. Then 
reference priors were elicited to preserve the desirable invariance properties of the Jeffreys 
priors, but to avoid such paradoxical consequences (Koop et al. 2007, chap. 8). The term 
"reference priors" was used by Millar (2002) for referring to Jeffreys prior, but is used here 
according to the usual terminology of Bayesian statistics, referring to the elicitation method 
proposed originally by Bernardo (1979) then refined by Berger and Bernardo (1992). 

In facts, difficulties mainly occur when parameters are hierarchised. This was formally 
demonstrated by Ghosal (1997). In fishery research, the main issue is evaluating the pop- 
ulation level over the time steps. It requires to focus on real catch values to get absolute 
evaluations. In a classical statistical frame, population levels and catches can be viewed as 
parameters to estimate or hidden states of Markov chains whose trajectories must be recon- 
stituted. In the Bayesian framework these differences become blurred and we can simply 
define them as interest parameters and denote their vector as Oj. 

However, some parameters always enter into population models because of observa- 
tional processes. Since they are unknown, complicate the inference and because we have 
less interest in estimating them than estimating population levels and catches, they take the 
sense of nuisance parameters (Idler 2001, chap. 3). Typically, they are at least three: the vari- 
ance i/)^ arising from the direct observation of catches, the variance (f)^ linked to the indirect 
observation of survey indices and the global catchability q linking the real survey indices 
and the population levels. Note that other parameters, for instance linked to recruitment or 
selectivity, notably differ from dj^ = [ilP' ,(l)'^,q) since they arise from biological mechanisms 
or technical features that are independent on the observational experiments. As an exam- 
ple, selectivity parameters intrinsically characterize the state of a fishnet, and some external 
knowledge is often available to provide a prior range of most probable values (Harley and 
Myers 2001). Thus, 9^ differs from other parameter vectors since it remains relative to a 
given experiment, and its prior distribution with density tt{9n) has to be considered non- 
informative. 

Therefore we suggest in this note to adopt the reference prior approach for eliciting 
benchmark priors for 9n, which completes the results given by Millar (2002). In the follow- 
ing, a common sequential population model is defined through its observational equation. 
No hypothesis is made on the linkage between populations and catches (for instance Bara- 
nov equations or their Pope's approximation). Then we give the definition of reference 
priors and provide our elicited prior '!t{9n) for nuisance parameters. 



Sequential population model 

Sequential population models present numerous variants, for instance depending on how 
natural mortality parameters are build, assumed to be constant or varying in function of 
some age classes. However, the following formulation encompasses a variety of modellings 
(see for instance Chassot et al. 2009), especially cohort models. 



Survey information. For a = 1, . . . , A (age classes) and t = 1, . . . , T (time steps), let la^t 
denote a survey index such that la.t = qsaNa,t where Sa denotes the selectivity-by-age 
characterizing survey gears and Na.t is the abundance of the studied species. Correlated 
observational errors are assumed between ages (Myers and Cadigan 1995), following clas- 
sical lognormal distributions: denoting /* ^ the observed index, one assumes 

Ja,t = logCf = log/a,t+ea,t+?7t (1) 

where ea,t ^■~^ M{0,a^) and rjt '■^ A/'(0,r^). The survey likelihood then arises from the T 
equations 

A 

J*t = ^J:,, =J*+ Alogg + 4, i = l,...,T, (2) 

a=l 

where .% = Y.t=i log ^aTVa,*, e[ '^^ U{0, 0^) and cj)^ = Aa^ + AW- 

Catch information. Following Aanes et al. (2007), we adopt the following formulation 
for the variations of observed total catches C'^ over ages. Denote Ct the real total catch at 
time t. Then the catch likelihood arises from the T equations 

C; = CteM^t), t=l,...,T, (3) 

where ut ~ Af{~ip'^ /2,tp'^). The lognormal error lyt has mean — V-'^/2 and not to ensure 
E[Cj*] = Ct- Otherwise, estimation naturally underestimates the real catches, making worse 
the frequent bias due to underreported landings Cf (Bousquet et al. 2010). Such a correc- 
tion (usually called Laurent correction) is not useful in the survey equation ^ since the 
bias E [/*(]— /(J t is only reflected in the estimation of q (Aanes et al. 2007). 

A condensed parametrization of the full observation model is given by (6*/, 9^) where 
6 1 = (Ji, ..., Jt, log Ci, ..., log Ct). To be very general, one assttmes that an informative 
prior measure Tr{9j\6N) can always be elicited (possibly independent on 9^). As proposed 
by Follestad (2003), this prior can encompass environmental fluctuations /z^'j, i = 1,2 (com- 
monly gaussian), such that 

Na+l.t+1 = f^^HNa,uFa.t,Ma,t)cxp{tlf}) 

where the /'■*' are deterministic functions (such as well-known Baranov or Pope's equations 
when A > 1, or simpler surplus-production relations if a unique age class is considered) 
and (F, M) denote fishing and natural mortality rates, and external information on biolog- 
ical parametric functionals (like (F, Al)). 



Eliciting the nuisance prior 

We are here interested in eliciting the Berger-Bernardo reference prior ^(0^), indepen- 
dently from any prior choice on 6i . Reference priors differ from Jeffreys priors in the sense 
they are defined as the optimal priors according to an information-theoretical criterion, 
while Jeffreys priors emerge from a formal definition of a so-called ideal feature (invari- 
ance to reparametrization). 

To be clearer, denote /(xJ^tv) the density of observed data x (here the (C^*, J^*)), uncon- 
ditionally to 6i, and let 6^ takes its values in the metric space O. The reference prior is the 
unique maximizer of the Kullback-Leibler divergence between the posterior and the prior 
densities 



KL(7r|x) - / -,{^eN\^)\og^^^^ dON 



on average on data that are marginally sampled, i.e. x ~ m(x) = J^ /(x|0Ar)7r(07v) d6, and 
assuming finally asymptotic conditions. In summary the reference prior is defined as 

TT* = argmaxi lim E„ [KL(7r|X)] I (4) 

'^ [card(x)^oo J 

The Kullback-Leibler distance between posterior and prior measures the (relative) Shan- 
non's information quantity that is yielded by the data into the inference (independently of 
any parametrization); maximizing it asymptotically is similar to minimizing the quantity 
of information yielded by the prior independently on the number of data. Hence the prior 
weight is minimized on average into the posterior. Further details can be found in Cover 
and Thomas (2006). 

Note that asymptotism makes sense in our framework, since the dimension of 9^ does 
not increase with the number 2T of observed data. One would have to be careful in the elic- 
itation of a conditional reference prior tt{6i\6n) since the dimension of 9j always linearly 
increases with T, because we consider the catches as parameters (besides this invalidates 
any frequentist estimation theory over dj, see Bousquet et al. 2010). Some relations (like 
-F— constraints, cf . Gavaris and lanelli 2002) can often be established to diminish the dimen- 
sion of Oj. But anjrway our final result ^{Om) will not depend on any hypothesis placed on 
the intern covariance of 6i. Finally, we give the reference prior in next theorem. Its proof is 
given in Appendix. 



Theorem 1. The solution of the optimization problem @ is 



The elicited prior is clearly improper except if some constant values other than and 
oo can bound the parameters. Note that 7r*(?/'^) and 7r*(0^) differ significantly from the 
familiar Jeffreys priors that can be derived from Millar (2002): 

7r(?/; ) ex 'il}~ , 7r(0 ) (X (j)~ . 

Giving more importance to smaller values of observational variances, they transmit more 
information to the estimation of parameters of interest. This is not the case for tt* (q) which 
is found similar to the classic scale-invariant measure (Millar 2002). 



Remark. Such priors remain conjugate conditionally to the set of all random quantities 
and observations, generically denoted V. Indeed one has 



\ t—1 a—1 ' 



t=i 




Eloff — 

, ^qSaNa.t 
.a—1 



-'w - ^5l^^^^-j:[c:-af^ 



which favors using within-Gibbs algorithms in any posterior computation of quantity of 
interest. 



Conclusion 

In this note we offer a counterpart to the choice of parametrization-invariant Jeffreys pri- 
ors to take account of the variability of nuisance parameters (observational variances and 
global catchability) in sequential population models. Our prior is elicited to avoid poten- 
tial incoherences arising from Jeffreys prior in high-dimensional models, while other priors 
elicited using past knowledge (or more generally expert knowledge) are placed on parame- 
ters of interest. As a new entrance into the catalog of reference priors for fisheries models, 
this note fits in line with the wishes expressed by Millar (2002) in his seminal work. 
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Appendix: proof of Theorem 1 

We start from an asymptotic development of E,„[KL(7r|X)], under classic regularity condi- 
tions, provided by Ghosh and Mukerjee (1992) and recalled by Sun and Berger (1998, Equ. 
2). Recalling that the size of our observed data is 2T, we have (up to an additive constant) 

E„[KL(7r|X)] = ^logT- / ^ [Qn) log ^^^^ dO^ + o{l) (5) 



where 



7r'^(6'Ar) cx cxp<^ - 



{\Jq <(^i\(^N)\og^^deiY (6) 



(0/ being the parameter space for 9i), |S| is the determinant of the per observation Fisher 
information matrix 



no) 



IX 



dHogf{x\e) 

8989' 



and Y.2{6i\9f4) is this same matrix for 6*/, given that 6^ is held fixed. One has {X* being a 
generic notation for (C^*, J^)) 

1 

= -^(logQ-logQ), 
1 (logQ-logCQ^ 



a2iog/(x*|6i) 


aiog^Ct 

92log/(X*|0) 


9 log CtaV'2 

dHogf{x*\e) 


9(^2)2 

a2iog/(x*|0) 


9(^2)2 
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d''logf{X*\e) 
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9(logg)2 
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11/ ~^2 



■J [Jt -Alogq- Jt 



d log gi9 Jt 
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Any other cross derivative is zero. With 




Ex [log C*- log Q 


] = -V'V2, 


Ex 


{iogc:-iogCtf 


= ^2 + ^V4, 


Ex 


(j;-Alogq-Jt) 


= 0, 


Ex 


(j;-Alogq-Jt)' 


= 0^ 


one can write 









E = 



0-4/2 
(0) 



A(f>-^^a 



(0) 



with 



/ A 1 1 

' 1 1 



V 1 



(0) 



1 \ 

(0) 

1 J 



with dimension T + 1 and 



-1 2 

-1 2 

i (0) 



-1 

(0) 

2 



with dimension T + 1. Note that Sb = Q^Qb with Qi, the lower triangular matrix 



-I/V2 V2 


(0) 


; (0) 

V -1/V2 


V2 


|S| OC 0-(6+2T)^-2(T+2) (r_l)^2_2 




^ / r'/T \ 





Qb 



Thus, we have 



Finally, note that 



where It is the unit diagonal matrix of dimension T. Thus 
Since |I]|/|I]2| does not depend on 62, from l|6ll we have 



^^(v^<^Mogg) (X v\n/\^2i 



which can be rewritten (assuming of course T > 1) 



7r■^(v^0^g) ex 7r*(V'^0^g)(^-l)^ 



V'2(r-i) 



\ 



Denoting KL(7ri||7r2) the Kullback-Leibler divergence between densities tti and tt2, the 
asymptotic criterion ^ becomes (up to an additive constant) 



E„[KL(^|X)] = ^logr + log(r-l)-KL(^||^*) + i 



77(6* at) log 



?/;2(T-l) 



(IOn +o{l), 



^logr-Ki(7r||7r*) + i 



^(0jv)(l + O(r-i)) cIOn 



when T increases. Since the Kullback-Leibler divergence is always positive or null, one 
finally must choose tt = tt* to maximize this asymptotic expression. Indeed, KL(7r||7r*) = 
if and only if tt = tt* (cf. Cover and Thomas 2006). 



